An Associative Model of Word Use

نویسندگان

  • Scott A. Waterman
  • James Pustejovsky
  • Richard Alterman
  • David L. Waltz
  • Branimir Boguraev
چکیده

An Associative Model of Word Use Scott A. Waterman A dissertation submitted to the Department of Computer Science and the Graduate Faculty of Brandeis University in partial fulfillment of the requirements for the degree of Doctor of Philosophy I describe a model of word usage in context that uses the co-occurrence of words with syntactic environments as evidence of the underlying grammatical constraints that license the combinations. Using these associations between words and contexts as a surrogate for their intrinsic grammatical properties enables us to reason about license, restriction, and preference without needing a detailed theory of these phenomena. The syntactic behavior of words can be defined in terms of their licensed occurrence within the space of contexts. Distributional similarity allows us to use observations of word/context co-occurrence to compare the syntactic behavior of words throughout the language. Various measures of structural similarity are introduced to enable us to also compare the intrinsic structural properties of the contexts. By allowing comparisons between similar-but-not-equal contexts, the structural similarity measure increases the breadth of events over which syntactic behavior between words can be compared. Previous studies of distributional similarity are limited by the inability to compare non-equal contexts. This introduction of a measure over the space of contexts also allows us to reduce the sparse nature of lexical co-occurrence data, and to extend the domain of the model to instances not observed during training. This is accomplished by inductively building a class model of the contexts, based on the structural similarity measure. The resulting context classes are shown to be locally coherent, and good predictors of word usage. The reduced representation of the context space then allows us to determine the intrinsic similarity of words, by enabling the comparison of their co-occurrences across the reduced space.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Associative Model of Word Use a Dissertation Proposal

I present an automatic method for distinguishing the various modes of use of words by categorizing context cues, and describe a model of word usage that characterizes the various syntactic contexts in which words appear without resort to a traditional grammar or part of speech labels. The model is built inductively by identifying patterns of word conngurations through observation of large text ...

متن کامل

More Naturalistic Cross-situational Word Learning

Previous research has found that people can use word-object co-occurrences from ambiguous situations to learn word meanings (e.g., Yu & Smith, 2007). However, most studies of cross-situational learning present an equal number of words and objects, which may simplify the problem by, for example, encouraging learners to use assumptions such as each word going with one object. This paper presents ...

متن کامل

Word Recognition and Learning based on Associative Memories and Hidden Markov Models

A word recognition architecture based on a network of neural associative memories and hidden Markov models has been developed. The input stream, composed of subword-units like wordinternal triphones consisting of diphones and triphones, is provided to the network of neural associative memories by hidden Markov models. The word recognition network derives words from this input stream. The archit...

متن کامل

Actively Learning Object Names Across Ambiguous Situations

Previous research shows that people can use the co-occurrence of words and objects in ambiguous situations (i.e., containing multiple words and objects) to learn word meanings during a brief passive training period (Yu & Smith, 2007). However, learners in the world are not completely passive but can affect how their environment is structured by moving their heads, eyes, and even objects. These ...

متن کامل

Large-scale Associative Memory for Word-Sense Disambiguation

In this paper we describe how an associative neural network can be used for word sense disambiguation. The model is implemented as a lexical transfer system for English to Japanese machine translation and trained on a parallel bilingual corpus. Key advantages of the approach are its ability to naturally and e ciently model context for large-scale language processing with a self-organizing knowl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000